47 research outputs found

    GraphArchive - An Online Graph Data Store

    Get PDF
    In this report, we present our approach 'GraphArchive'. The solution attempts to enable researchers to exchange and archive graphs. The software is developed as an online platform using modern web technologies. In the document, features and architecture of GraphArchive are presented and the former approach 'GraphDB' is compared to the new system. Also, reader are taken on a typical walk through the system using a common use case for GraphArchive. News and development status of the system can be also visited at http://www.graph-archive.org

    The Open Graph Archive: A Community-Driven Effort

    Full text link
    In order to evaluate, compare, and tune graph algorithms, experiments on well designed benchmark sets have to be performed. Together with the goal of reproducibility of experimental results, this creates a demand for a public archive to gather and store graph instances. Such an archive would ideally allow annotation of instances or sets of graphs with additional information like graph properties and references to the respective experiments and results. Here we examine the requirements, and introduce a new community project with the aim of producing an easily accessible library of graphs. Through successful community involvement, it is expected that the archive will contain a representative selection of both real-world and generated graph instances, covering significant application areas as well as interesting classes of graphs.Comment: 10 page

    Short sequence motifs, overrepresented in mammalian conserved non-coding sequences

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>A substantial fraction of non-coding DNA sequences of multicellular eukaryotes is under selective constraint. In particular, ~5% of the human genome consists of conserved non-coding sequences (CNSs). CNSs differ from other genomic sequences in their nucleotide composition and must play important functional roles, which mostly remain obscure.</p> <p>Results</p> <p>We investigated relative abundances of short sequence motifs in all human CNSs present in the human/mouse whole-genome alignments <it>vs</it>. three background sets of sequences: (i) weakly conserved or unconserved non-coding sequences (non-CNSs); (ii) near-promoter sequences (located between nucleotides -500 and -1500, relative to a start of transcription); and (iii) random sequences with the same nucleotide composition as that of CNSs. When compared to non-CNSs and near-promoter sequences, CNSs possess an excess of AT-rich motifs, often containing runs of identical nucleotides. In contrast, when compared to random sequences, CNSs contain an excess of GC-rich motifs which, however, lack CpG dinucleotides. Thus, abundance of short sequence motifs in human CNSs, taken as a whole, is mostly determined by their overall compositional properties and not by overrepresentation of any specific short motifs. These properties are: (i) high AT-content of CNSs, (ii) a tendency, probably due to context-dependent mutation, of A's and T's to clump, (iii) presence of short GC-rich regions, and (iv) avoidance of CpG contexts, due to their hypermutability. Only a small number of short motifs, overrepresented in all human CNSs are similar to binding sites of transcription factors from the FOX family.</p> <p>Conclusion</p> <p>Human CNSs as a whole appear to be too broad a class of sequences to possess strong footprints of any short sequence-specific functions. Such footprints should be studied at the level of functional subclasses of CNSs, such as those which flank genes with a particular pattern of expression. Overall properties of CNSs are affected by patterns in mutation, suggesting that selection which causes their conservation is not always very strong.</p

    Walking pathways with positive feedback loops reveal DNA methylation

    Get PDF
    Background: the search for molecular biomarkers of early-onset colorectal cancer (CRC) is an important but still quite challenging and unsolved task. Detection of CpG methylation in human DNA obtained from blood or stool has been proposed as a promising approach to a noninvasive early diagnosis of CRC. Thousands of abnormally methylated CpG positions in CRC genomes are often located in non-coding parts of genes. Novel bioinformatic methods are thus urgently needed for multi-omics data analysis to reveal causative biomarkers with a potential driver role in early stages of cancer. Methods: we have developed a method for finding potential causal relationships between epigenetic changes (DNA methylations) in gene regulatory regions that affect transcription factor binding sites (TFBS) and gene expression changes. This method also considers the topology of the involved signal transduction pathways and searches for positive feedback loops that may cause the carcinogenic aberrations in gene expression. We call this method 'Walking pathways', since it searches for potential rewiring mechanisms in cancer pathways due to dynamic changes in the DNA methylation status of important gene regulatory regions ('epigenomic walking'). Results: in this paper, we analysed an extensive collection of full genome gene-expression data (RNA-seq) and DNA methylation data of genomic CpG islands (using Illumina methylation arrays) generated from a sample of tumor and normal gut epithelial tissues of 300 patients with colorectal cancer (at different stages of the disease) (data generated in the EU-supported SysCol project). Identification of potential epigenetic biomarkers of DNA methylation was performed using the fully automatic multi-omics analysis web service 'My Genome Enhancer' (MGE) (my-genome-enhancer.com). MGE uses the database on gene regulation TRANSFAC®, the signal transduction pathways database TRANSPATH®, and software that employs AI (artificial intelligence) methods for the analysis of cancer-specific enhancers. Conclusions: the identified biomarkers underwent experimental testing on an independent set of blood samples from patients with colorectal cancer. As a result, using advanced methods of statistics and machine learning, a minimum set of 6 biomarkers was selected, which together achieve the best cancer detection potential. The markers include hypermethylated positions in regulatory regions of the following genes: CALCA, ENO1, MYC, PDX1, TCF7, ZNF43

    Role of Phagocytosis in the Pro-Inflammatory Response in LDL-Induced Foam Cell Formation; a Transcriptome Analysis

    Get PDF
    Excessive accumulation of lipid inclusions in the arterial wall cells (foam cell formation) caused by modified low-density lipoprotein (LDL) is the earliest and most noticeable manifestation of atherosclerosis. The mechanisms of foam cell formation are not fully understood and can involve altered lipid uptake, impaired lipid metabolism, or both. Recently, we have identified the top 10 master regulators that were involved in the accumulation of cholesterol in cultured macrophages induced by the incubation with modified LDL. It was found that most of the identified master regulators were related to the regulation of the inflammatory immune response, but not to lipid metabolism. A possible explanation for this unexpected result is a stimulation of the phagocytic activity of macrophages by modified LDL particle associates that have a relatively large size. In the current study, we investigated gene regulation in macrophages using transcriptome analysis to test the hypothesis that the primary event occurring upon the interaction of modified LDL and macrophages is the stimulation of phagocytosis, which subsequently triggers the pro-inflammatory immune response. We identified genes that were up- or downregulated following the exposure of cultured cells to modified LDL or latex beads (inert phagocytosis stimulators). Most of the identified master regulators were involved in the innate immune response, and some of them were encoding major pro-inflammatory proteins. The obtained results indicated that pro-inflammatory response to phagocytosis stimulation precedes the accumulation of intracellular lipids and possibly contributes to the formation of foam cells. In this way, the currently recognized hypothesis that the accumulation of lipids triggers the pro-inflammatory response was not confirmed. Comparative analysis of master regulators revealed similarities in the genetic regulation of the interaction of macrophages with naturally occurring LDL and desialylated LDL. Oxidized and desialylated LDL affected a different spectrum of genes than naturally occurring LDL. These observations suggest that desialylation is the most important modification of LDL occurring in vivo. Thus, modified LDL caused the gene regulation characteristic of the stimulation of phagocytosis. Additionally, the knock-down effect of five master regulators, such as IL15, EIF2AK3, F2RL1, TSPYL2, and ANXA1, on intracellular lipid accumulation was tested. We knocked down these genes in primary macrophages derived from human monocytes. The addition of atherogenic naturally occurring LDL caused a significant accumulation of cholesterol in the control cells. The knock-down of the EIF2AK3 and IL15 genes completely prevented cholesterol accumulation in cultured macrophages. The knock-down of the ANXA1 gene caused a further decrease in cholesterol content in cultured macrophages. At the same time, knock-down of F2RL1 and TSPYL2 did not cause an effect. The results obtained allowed us to explain in which way the inflammatory response and the accumulation of cholesterol are related confirming our hypothesis of atherogenesis development based on the following viewpoints: LDL particles undergo atherogenic modifications that, in turn, accompanied by the formation of self-associates; large LDL associates stimulate phagocytosis; as a result of phagocytosis stimulation, pro-inflammatory molecules are secreted; these molecules cause or at least contribute to the accumulation of intracellular cholesterol. Therefore, it became obvious that the primary event in this sequence is not the accumulation of cholesterol but an inflammatory response

    Comparing nuclear power trajectories in Germany and the UK: from ‘regimes' to ‘democracies’ in sociotechnical transitions and Discontinuities

    Get PDF
    This paper focuses on arguably the single most striking contrast in contemporary major energy politics in Europe (and even the developed world as a whole): the starkly differing civil nuclear policies of Germany and the UK. Germany is seeking entirely to phase out nuclear power by 2022. Yet the UK advocates a ‘nuclear renaissance’, promoting the most ambitious new nuclear construction programme in Western Europe.Here,this paper poses a simple yet quite fundamental question: what are the particular divergent conditions most strongly implicated in the contrasting developments in these two countries. With nuclear playing such an iconic role in historical discussions over technological continuity and transformation, answering this may assist in wider understandings of sociotechnical incumbency and discontinuity in the burgeoning field of‘sustainability transitions’. To this end, an ‘abductive’ approach is taken: deploying nine potentially relevant criteria for understanding the different directions pursued in Germany and the UK. Together constituted by 30 parameters spanning literatures related to socio-technical regimes in general as well as nuclear technology in particular, the criteria are divided into those that are ‘internal’ and ‘external’ to the ‘focal regime configuration’ of nuclear power and associated ‘challenger technologies’ like renewables. It is ‘internal’ criteria that are emphasised in conventional sociotechnical regime theory, with ‘external’ criteria relatively less well explored. Asking under each criterion whether attempted discontinuation of nuclear power would be more likely in Germany or the UK, a clear picture emerges. ‘Internal’ criteria suggest attempted nuclear discontinuation should be more likely in the UK than in Germany– the reverse of what is occurring. ‘External’ criteria are more aligned with observed dynamics –especially those relating to military nuclear commitments and broader ‘qualities of democracy’. Despite many differences of framing concerning exactly what constitutes ‘democracy’, a rich political science literature on this point is unanimous in characterising Germany more positively than the UK. Although based only on a single case,a potentially important question is nonetheless raised as to whether sociotechnical regime theory might usefully give greater attention to the general importance of various aspects of democracy in constituting conditions for significant technological discontinuities and transformations. If so, the policy implications are significant. A number of important areas are identified for future research, including the roles of diverse understandings and specific aspects of democracy and the particular relevance of military nuclear commitments– whose under-discussion in civil nuclear policy literatures raises its own questions of democratic accountability

    Advanced Computational Biology Methods Identify Molecular Switches for Malignancy in an EGF Mouse Model of Liver Cancer

    Get PDF
    The molecular causes by which the epidermal growth factor receptor tyrosine kinase induces malignant transformation are largely unknown. To better understand EGFs' transforming capacity whole genome scans were applied to a transgenic mouse model of liver cancer and subjected to advanced methods of computational analysis to construct de novo gene regulatory networks based on a combination of sequence analysis and entrained graph-topological algorithms. Here we identified transcription factors, processes, key nodes and molecules to connect as yet unknown interacting partners at the level of protein-DNA interaction. Many of those could be confirmed by electromobility band shift assay at recognition sites of gene specific promoters and by western blotting of nuclear proteins. A novel cellular regulatory circuitry could therefore be proposed that connects cell cycle regulated genes with components of the EGF signaling pathway. Promoter analysis of differentially expressed genes suggested the majority of regulated transcription factors to display specificity to either the pre-tumor or the tumor state. Subsequent search for signal transduction key nodes upstream of the identified transcription factors and their targets suggested the insulin-like growth factor pathway to render the tumor cells independent of EGF receptor activity. Notably, expression of IGF2 in addition to many components of this pathway was highly upregulated in tumors. Together, we propose a switch in autocrine signaling to foster tumor growth that was initially triggered by EGF and demonstrate the knowledge gain form promoter analysis combined with upstream key node identification

    Using systems medicine to identify a therapeutic agent with potential for repurposing in inflammatory bowel disease

    Get PDF
    ObjectiveInflammatory bowel diseases cause significant morbidity and mortality. Aberrant NF-κB signalling is strongly associated with these conditions, and several established drugs influence the NF-κB signalling network to exert their effect. This study aimed to identify drugs which alter NF-κB signalling and may be repositioned for use in inflammatory bowel disease.DesignThe SysmedIBD consortium established a novel drug-repurposing pipeline based on a combination of in-silico drug discovery and biological assays targeted at demonstrating an impact on NF-kappaB signalling, and a murine model of IBD.ResultsThe drug discovery algorithm identified several drugs already established in IBD, including corticosteroids. The highest-ranked drug was the macrolide antibiotic Clarithromycin, which has previously been reported to have anti-inflammatory effects in aseptic conditions. Clarithromycin's effects were validated in several experiments: it influenced NF-κB mediated transcription in murine peritoneal macrophages and intestinal enteroids; it suppressed NF-κB protein shuttling in murine reporter enteroids; it suppressed NF-κB (p65) DNA binding in the small intestine of mice exposed to LPS, and it reduced the severity of dextran sulphate sodium-induced colitis in C57BL/6 mice. Clarithromycin also suppressed NF-κB (p65) nuclear translocation in human intestinal enteroids.ConclusionsThese findings demonstrate that in-silico drug repositioning algorithms can viably be allied to laboratory validation assays in the context of inflammatory bowel disease; and that further clinical assessment of clarithromycin in the management of inflammatory bowel disease is required

    Molecular mechanistic associations of human diseases

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The study of relationships between human diseases provides new possibilities for biomedical research. Recent achievements on human genetic diseases have stimulated interest to derive methods to identify disease associations in order to gain further insight into the network of human diseases and to predict disease genes.</p> <p>Results</p> <p>Using about 10000 manually collected causal disease/gene associations, we developed a statistical approach to infer meaningful associations between human morbidities. The derived method clustered cardiometabolic and endocrine disorders, immune system-related diseases, solid tissue neoplasms and neurodegenerative pathologies into prominent disease groups. Analysis of biological functions confirmed characteristic features of corresponding disease clusters. Inference of disease associations was further employed as a starting point for prediction of disease genes. Efforts were made to underpin the validity of results by relevant literature evidence. Interestingly, many inferred disease relationships correspond to known clinical associations and comorbidities, and several predicted disease genes were subjects of therapeutic target research.</p> <p>Conclusions</p> <p>Causal molecular mechanisms present a unifying principle to derive methods for disease classification, analysis of clinical disorder associations, and prediction of disease genes. According to the definition of causal disease genes applied in this study, these results are not restricted to genetic disease/gene relationships. This may be particularly useful for the study of long-term or chronic illnesses, where pathological derangement due to environmental or as part of sequel conditions is of importance and may not be fully explained by genetic background.</p

    Multiple novel prostate cancer susceptibility signals identified by fine-mapping of known risk loci among Europeans

    Get PDF
    Genome-wide association studies (GWAS) have identified numerous common prostate cancer (PrCa) susceptibility loci. We have fine-mapped 64 GWAS regions known at the conclusion of the iCOGS study using large-scale genotyping and imputation in 25 723 PrCa cases and 26 274 controls of European ancestry. We detected evidence for multiple independent signals at 16 regions, 12 of which contained additional newly identified significant associations. A single signal comprising a spectrum of correlated variation was observed at 39 regions; 35 of which are now described by a novel more significantly associated lead SNP, while the originally reported variant remained as the lead SNP only in 4 regions. We also confirmed two association signals in Europeans that had been previously reported only in East-Asian GWAS. Based on statistical evidence and linkage disequilibrium (LD) structure, we have curated and narrowed down the list of the most likely candidate causal variants for each region. Functional annotation using data from ENCODE filtered for PrCa cell lines and eQTL analysis demonstrated significant enrichment for overlap with bio-features within this set. By incorporating the novel risk variants identified here alongside the refined data for existing association signals, we estimate that these loci now explain ∼38.9% of the familial relative risk of PrCa, an 8.9% improvement over the previously reported GWAS tag SNPs. This suggests that a significant fraction of the heritability of PrCa may have been hidden during the discovery phase of GWAS, in particular due to the presence of multiple independent signals within the same regio
    corecore